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Abstract. We consider the following open question in the spirit of Ram- 
sey theory: Given an aperiodic infinite word w, does there exist a finite 
coloring of its factors such that no factorization of w is monochromatic? 
We show that such a coloring always exists whenever w is a Sturmian 
word or a standard episturmian word. 



1 Introduction 

Ramsey theory (including Van der Waerden's theorem) (see [5]) is a topic of great 
interest in combinatorics with connections to various fields of mathematics. A re- 
markable consequence of Ramsey's Infinitary Theorem applied to combinatorics 
on words yields the following unavoidable regularity of infinite words 1 : 

Theorem 1. Let A be a non-empty alphabet, w be an infinite word over A, C 
a finite non-empty set (the set of colors), and c : Fact + w — > C any coloring of 
the set Fact" 1 " w of all non-empty factors ofw. Then there exists a factorization 
of w of the form w — VU\Ui •••[/„■•• such that for all positive integers i and 
3> c {Ui) — c(Uj). 

One can ask whether given an infinite word there exists a suitable coloring 
map able to avoid the monochromaticity of all factors in all factorizations of the 
word. More precisely, the following variant of Theorem 1 was posed as a question 
by T.C. Brown [3] and, independently, by the third author [11]: 

Question 1 . Let w be an aperiodic infinite word over a finite alphabet A. Does 
there exist a finite coloring c : Fact + w —> C with the property that for any 
factoring w — U1U2 ••■{/„•••, there exist positive integers i,j for which c(L^) 7^ 
c(Uj) ? 



1 Actually, the proof of Theorem 1 given by Schiitzenberger in [10] does not use Ram- 
sey's theorem 



Let us observe that for periodic words the answer to the preceding question 
is trivially negative. Indeed, let w — U u , and c : Fact + w — > C be any finite 
coloring. By factoring w as w = U\Ui ■ • ■ U n ■ ■ • , where for alH > 1, Ui = U one 
has c(Ui) — c(Uj) for all positive integers i and j. It is easy to see that there exist 
non recurrent infinite words w and finite colorings such that for any factoring 
w = U1II2 ■••?/„••• there exist i 7^ j for which c(Ui) ^ c{Uj). For instance, 
consider the infinite word w — ab" and define the coloring map as follows: for 
any non-empty factor U of w, c(U) = 1 if it contains a and c(U) = 0, otherwise. 
Then for any factoring w — U\Ui •■•£/„••■, c{U\) = 1 and c(Ui) = for all 

1 > 1. 

It is not very difficult to prove that there exist infinite recurrent words for 
which Question 1 has a positive answer, for instance square-free, overlap-free 
words, and standard Sturmian words [11]. 

In this paper we show that Question 1 has a positive answer for every Stur- 
mian word where the number of colors is equal to 3. This solves a problem raised 
in [3] and in [11]. The proof requires some noteworthy new combinatorial prop- 
erties of Sturmian words. Moreover, we prove that the same result holds true for 
aperiodic standard episturmian words by using a number of colors equal to the 
number of distinct letters occurring in the word plus one. 

For all definitions and notation not explicitly given in the paper, the reader 
is referred to the books of M. Lothaire [7, 8]; for Sturmian words see [8, Chap. 2] 
and for episturmian words see [4, 6] and the survey of J. Bcrstcl [1]. 

2 Sturmian words 

There exist several equivalent definitions of Sturmian words. In particular, we 
recall (see, for instance, Theorem 2.1.5 of [8]) that an infinite word s 6 {a, b} u 
is Sturmian if and only if it is aperiodic and balanced, i.e., for all factors u and 
v of s such that \u\ = \v\ one has: 

\\u\ x - \v\ x \ < 1, x G {a, b}, 

where \u\ x denotes the number of occurrences of the letter x in u. Since a Stur- 
mian word s is aperiodic, it must have at least one of the two factors aa and 
bb. However, from the balance property, it follows that a Sturmian word cannot 
have both the factors aa and bb. 

Definition 1. We say that a Sturmian word is of type a (resp. b) if it does not 
contain the factor bb (resp. aa). 

We recall that a factor u of a finite or infinite word w over the alphabet A is 
called right special (resp. left special) if there exist two different letters x,y G A 
such that mx, uy (resp. xu, yu) are factors of w. 

A different equivalent definition of a Sturmian word is the following: A binary 
infinite word s is Sturmian if for every integer n > 0, s has a unique left (or 
cquivalently right) special factor of length n. It follows from this that s is closed 
under reversal, i.e., if u is a factor of s so is its reversal u~. 



A Sturmian word s is called standard (or characteristic) if all its prefixes are 
left special factors of s. As is well known, for any Sturmian word s there exists 
a standard Sturmian word t such that Fact s = Fact t, where for any finite or 
infinite word w, Factum denotes the set of all its factors including the empty 
word. 

Definition 2. Let s G {a, 6} w be a Sturmian word. A non-empty factor w of s 
is rich in the letter z G {a, b} if there exists a factor v of s such that \v\ — \w\ 
and \w\ z > \v\ z . 

From the aperiodicity and the balance property of a Sturmian word one easily 
derives that any non-empty factor w of a Sturmian word s is rich either in the 
letter a or in the letter b but not in both letters. Thus one can introduce for any 
given Sturmian word s a map 

r s : Fact + s {a, b} 

defined as: for any non-empty factor w of s, r s (w) = z G {a, b} if w is rich in the 
letter z. Clearly, r s (w) — r s (w~) for any w G Fact + s. 

For any letter z G {a, b} we shall denote by z the complementary letter of z, 
i.e., a — b and b — a. 

Lemma 1. Let w be a non-empty right special (resp. left special) factor of a 
Sturmian word s. Then r s (w) is equal to the first letter of w (resp. r s (w) is 
equal to the last letter of w). 

Proof. Write w — zw' with z G {a, b} and w' G {a, b}* . Since w is a right special 
factor of s one has that v = w'z is a factor of s. Thus \w\ = \v\ and \w\ z > \v\ z , 
whence r s (w) = z. Similarly, if w is left special one deduces that r s (w) is equal 
to the last letter of w. □ 

3 Preliminary Lemmas 

Lemma 2. Let s be a Sturmian word such that 

s = HUi, 

i>i 

where the Ui's are non-empty factors of s. If for every i and j , r s (Ui) = r s (Uj), 
then for any M > there exists an integer i such that \Ui\ > M. 

Proof. Suppose to the contrary that for some positive integer M we have that 
\Ui\ < M for each i > 1. This implies that the number of distinct LVs in the 
sequence (Ui)i>i is finite, say t. Let r s (Ui) = x G {a, b} for all i > 1 and set for 
each i > 1 : 



Thus {fi | i > 1} is a finite set of at most t rational numbers. We set r = 
min{/ 4 | i > 1}. 

Let f x (s) be the frequency of the letter i in s defined as 

lo. ,1 

f x (s) = lim 



where for every n > 1, S[„] denotes the prefix of s of length n. As is well known 
(see Prop. 2.1.11 of [8]), f x (s) exists and is an irrational number. 

Let us now prove that r > f x (s). From Proposition 2.1.10 in [8] one derives 
that for all V G Fact s 

\V\f x (s)-K\V\ x <\V\f x (s) + l. 

Now for any i > 1, C/j is rich in the letter x, so that there exists Vi £ Fact s such 
that \Ui\ = \Vi\ and \Ui\ x > \Vi\ x . From the preceding inequality one has: 

\Ui\ x = \Vi\ x + 1 > \Vi\f x (s) = \Ui\f x (s), 

so that for alH > 1, fi > f x (s), hence r > f x (s). 

For any n > 0, we can write the prefix S[„] of length n as: 

«[n] = t/i ' • • U k U' k+1 , 
for a suitable k > and ?7£+i a prefix of f/fe+i. Thus 

k 

\s[n]\x = \Ui\ x + \U' k+l \ x . 

i—i 

Since \Ui\ x = ,f l \U l \ >r\U l \ and \U' k+l \ < M, one has 

k 

\s [n] \ x >rJ2\Ui\=r(n-\U k+1 \)>rn-rM. 



Thus 



and 



*[n] U ^ M 

— LJ > r _ r 



n n 



/ x (s) = hm — > r, 

n— t-oo 77, 



a contradiction. □ 

In the following we shall consider the Sturmian morphism R a , that we simply 
denote R, defined as: 

R(a) = a and R(b) = ba. (1) 

For any finite or infinite word w, Pref w will denote the set of all its prefixes. 
The following holds: 



Lemma 3. Let s be a Sturmian word and t G {a, b} u such that R(t) — s. If 
either 

1 ) the first letter of t ( or, equivalently, of s) is b 
or 

2) the Sturmian word s admits a factorization: 

s = Ui •••?/„•• • , 

where each Ui, i > 1, is a non-empty prefix of s terminating in the letter a 
and r s (Ui) = r s (Uj) for all i,j > 1, 

then t is also Sturmian. 

Proof. Let us prove that in both cases t is balanced. Suppose to the contrary 
that t is unbalanced. Then (see Prop. 2.1.3 of [8]) there would exists v such that 

ava, bvb G Fact t. 

Thus 

aR(v)a, baR(v)ba G Fact s. 

If ava Prefi, then t = Xava^j,, with A G {a, b} + and fi G {a, 6}". There- 
fore R(t) = R(\)R(ava)R(n). Since the last letter of R(X) is a, it follows that 
aaR(v)a G Facts. As baR(v)b G Facts we reach a contradiction with the bal- 
ance property of s. In case 1), t begins in the letter b, so that ava Prefi 
and then t is balanced. In case 2) suppose that ava G Pref t. This implies that 
aR(v)a G Pref s. From the preceding lemma in the factorization of s in prefixes 
there exists an integer i > 1 such that \Ui\ > \aR(v)a\. Since Ui-i terminates 
in a and Ui-\Ui G Facts, it follows that aaR(v)a G Facts and one contradicts 
again the balance property of s. Hence, t is balanced. 

Trivially, in both cases t is aperiodic, so that t is Sturmian. □ 

Let us remark that, in general, without any additional hypothesis, if s = R(t), 
then t need not be Sturmian. For instance, if / is the Fibonacci word / = 
abaababaaba • • • , then af is also a Sturmian word. However, it is readily verified 
that in this case the word t such that R(t) = s is not balanced, so that t is not 
Sturmian. 

For any finite or infinite word w over the alphabet A, alphw denotes the 
set of all distinct letters of A occurring in w. We will make use of the following 
lemma: 

Lemma 4. Let s be a Sturmian word having a factorization 

s = Ui ■ ■ ■ U n ■ ■ ■ , 

where for i > 1, Ui are non-empty prefixes of s. Then for any p > 1, U\ ^ c p 
where c is the first letter of s. 

Proof. Suppose that U\ = c p . Since s is aperiodic there exists a minimal in- 
teger j such that card(alph Uj) = 2. Since Uj is a prefix of s, one has then 
Ui ■ ■ ■ Uj-iUj = Uj£, with £ G {a, b}*. As U\ ■ ■ ■ Uj-i = c q for a suitable q > p, 
it follows that £ = c q and Uj G cc* , a contradiction. □ 



4 Main results 



Proposition 1. Let s be a Sturmian word of type a having a factorization 

s = Ux ■ ■ ■ U n ■ ■ ■ , 

where for i > 1, Ui are non-empty prefixes of s such that r s (Ui) = r s (Uj) for all 
i,j > 1- Then one of the following two properties holds: 

i) All Ui, i > 1, terminate in the letter a. 
ii) For all i > 1, Uia G Prcf s. 

Proof. Let us first suppose that s begins in the letter b. All prefixes Ui, i > 1, 
of s begin in the letter b and, as s is of type a, have to terminate in the letter a. 
Thus in this case Property i) is satisfied. 

Let us then suppose that s begins in the letter a. Now either all prefixes Ui, 
i > 1, terminate in the letter a or all prefixes Ui, i > 1, terminate in the letter 
b or some of the prefixes terminate in the letter a and some in the letter b. We 
have then to consider the following cases: 

Case 1. All prefixes Ui, i > 1, terminate in the letter b. 

Since s is of type a, no one of the prefixes Ui, i > 1, can be a right special 
factor. This implies that l^a G Pref s and Property ii) is satisfied. 

Case 2. Some of the prefixes Ui, i > 1, terminate in the letter a and some in the 
letter b. 

We have to consider two subcases: 

a) . r s (Ui) = b, for alH > 1. 

As all Ui, i > 1, begin in a, if any Ui were right special, then by Lemma 1, 
r s {Ui) = a, a contradiction. It follows that for alH > 1, l^a G Prcf s. 

b) . r s (Ui) = a, for all i > 1. 

Some of the prefixes Uj, j > 1, terminate in a (since otherwise we are in 
Case 1). Let Uk be a prefix terminating in a for a suitable k > 1. If a prefix 
Ui terminates in b, then aUi is not a factor of s. Indeed, otherwise, the word 
aUib^ 1 is such that \aUib~ 1 ] — \Ui\ and laUib^ 1 ^ < \Ui\b, so that r s {Ui) = b a 
contradiction. Thus one derives that all Ui with I > k terminate in a. Moreover, 
if some Ui terminate in b, by Lemma 2 there exists j > k such that Uj has the 
prefix Ui, so that Uj-iUi G Facts. Since Uj-i terminates in a, one has that aUi 
is a factor of s, a contradiction. Thus all Ui, i>l, terminate in a. □ 

Proposition 2. Let s be a Sturmian word having a factorization 

s = U\---U n --- , 

where for i > 1, £7j are non-empty prefixes of s such that r s (Ui) = r s (Uj) for all 
i,j > 1- Then there exists a Sturmian word t such that 

t = Vi • • • V n ■ ■ ■ , 

where for alii > l,Vi are non-empty prefixes oft, r t (Vi) ~ r t (Vj) for all i,j > 1, 
and |VL| < |E7i|. 



Proof. We can suppose without loss of generality that s is a Sturmian word of 
type a. From Proposition 1 either all Ui, i > 1, terminate in the letter a or for 
all i > 1, Uid € Pref s. We consider two cases: 

Case 1. For alH > 1, Ua G Pref s. 

We can suppose that s begins in the letter a. Indeed, otherwise, if the first 
letter of s is b, then all U, i > 1, begin in the letter b and, as s is of type a, they 
have to terminate in the letter a. Thus the case that the first letter of s is b will 
be considered when we will analyze Case 2. 

We consider the injective endomorphism of {a, b}*, L a , or simply L, defined 

by 

L(a) = a and L(b) = ab. 

Since s is of type a, the first letter of s is a, and X = {a, ab} is a code having 
a finite deciphering delay (cf. [2]), the word s can be uniquely factorized by the 
elements of X. Thus there exists a unique word t G {a, b} u such that s = L(t). 
The following holds: 

1. The word t is a Sturmian word. 

2. For any i > 1 there exists a non-empty prefix Vi of i such that L(Vi) = f/j. 

3. The word t can be factorized as t = V\ ■ ■ ■ V n ■ ■ ■ . 

4. |Vi| < 

5. For alU.j > 1, r t (Vj) = r t (^). 

Point 1. This is a consequence of the fact that L is a standard Sturmian morphism 
(see Corollary 2.3.3 in Chap. 2 of [8]). 

Point 2. For any i > 1, since £/j<z S Pref s and any pair (c, a) with c G {a, 6} is 
synchronizing for X°° = X* U X" (cf. [2]), one has that U £ X*, so that there 
exists Vi <E Pref t such that L{Vi) = Ui. 

Point 3. One has L(Vi ■•• V n •••) = f/i •••[/„••• = s = L(t). Thus i = Vi • • • K • • • . 

Point 4. By Lemma 4, J7i is not a power of a so that in U\ there must be at 
least one occurrence of the letter b. This implies that \V\ \ < \U\\. 

Point 5. We shall prove that for all i > 1, r t (Vi) — r s (U). From this one has 
that for all i,j > 1, r t (Vi) = r t (Vj). 

Since t is a Sturmian word, there exists V- G Fact t such that 

\Vi\ = |V/| and either \V,\ a > |V/| or |VS|„ < |V/| . 

In the first case r t (Vi) = a and in the second case r t (Vi) =6. Let us set 

Fi = L{Vl). 

Since Ui — L(Vi), from the definition of the morphism L one has: 

|^| = |V/| a + |V/| fe = |V/|, |^| 6 = |V/| 6 . (2) 

\Ui\ a - \Vi\ a + \Vi\ b = \Vi\, \Ui\ h = \Vi\ h . (3) 



Let us first consider the case r t (Vi) = a, i.e., \Vi\ a = \V(\ a + 1 and \Vi\b = 
\V-\b — 1. From the preceding equations one has: 

\Fi\ = \Ui\ + 1. 

Moreover, from the definition of L one has that Fi begins in the letter a. Hence, 
la" 1 *!! = \U\ and ja" 1 ^ = |fi|„-l= \U l \ a -l. Thus \U t \ a > ja" 1 ^. Since 
a~ 1 F i £ Facts, one has 

r s (U) =r t (Yi) =a. 

Let us now consider the case r t (Vi) = b, i.e., \Vi\ a = \V[\ a — l and \Vi\b = \V-\b + l- 
From (2) and (3) one derives: 

\Ui\ = \Fi\ + l, 

and \Ui\b > \Fi\b- Now Fid is a factor of s. Indeed, Fi — L(V() and for any 
c £ {a, b) such that V(c £ Facti one has L(Vjc) — FiL(c). Since for any letter 
c, L(c) begins in the letter a it follows that i^a £ Facts. Since i^a = |C/j| and 
> \Fi\b — \Fia\b, one has that Ui is rich in b. Hence, r s (Ui) = r t (Vi) = b. 

Case 2. All Ui, i > 1, terminate in the letter a. 

We consider the injective endomorphism of {a, b}*, R a , or simply R, defined 
in (1). Since s is of type a and X = {a,ba} is a prefix code, the word s can 
be uniquely factorized by the elements of X. Thus there exists a unique word 
t £ {a, 6}" such that s = R(t). The following holds: 

1. The word t is a Sturmian word. 

2. For any i > 1 there exists a non-empty prefix Vi of i such that i?(Vi) = ?7j. 

3. The word t can be factorized as t = V\ ■ ■ ■ V n ■ ■ ■ . 

4. |Vi| < 

5. For aUi,j>l,r t (Vi) = r t (^)- 

Point 1. From Lemma 3, since R(t) = s it follows that £ is Sturmian. 

Point 2. For any i > 1, since 1^ terminates in the letter a and any pair (a, c) 
with c e {a, &} is synchronizing for X°° , one has that Ui £ X* , so that there 
exists Vi £ Preit such that R(Vi) — U. 

Point 3. One has R(Vi ■ ■ ■ V n ■ ■ ■ ) = t/i • • • U n ■ ■ ■ = s = R(t). Thus t = V 1 ---V n --- 

Point 4. By Lemma 4, U\ is not a power of the first letter c of s, so that in U\ 
there must be at least one occurrence of the letter c. This implies that | V\ < \U\\. 

Point 5. We shall prove that for all i > 1, r t (Vi) — r s (Ui). From this one has 
that for all i,j > 1, r t (V t ) = r t (Vj). 

Since t is a Sturmian word, there exists V- £ Fact t such that 

\Vi\ = \V?\ and either \Vi\ a > \Vj\ a or \Vi\ a < \V?\ a . 

In the first case r t (Vi) = a and in the second case r t {Vi) — b. Let us set 



Fi = R{V(). 



Since Ui — R{Vi), from the definition of the morphism R one has that equations 
(2) and (3) are satisfied. 

Let us first consider the case r t (Vi) = a, i.e., \Vi\ a = \V(\ a + 1 and \Vi\b = 
\V,'\b — 1. From the preceding equations one has: 

\Fi\ = |^| + 1. 

From the definition of the morphism R one has that Fi = R{V() terminates in 
the letter a. Hence, l^a" 1 = \U t \ and ^a" 1 ^ = \F l \ a - 1 = \U l \ a - 1. Thus 
\Ui\ a = \Fia~ 1 \ a + 1, so that Ui is rich in a and r s (Ui) — r t {Vi) = a. 

Let us now suppose that r t (Vi) = b, i.e., |Vi| Q = |V^'| a -l and \ Vi\ b = \V-\b + l. 
From (2) and (3) one derives: 

\Ui\ = + 

and \Ui\b > \Fi\b- We prove that aFi e Facts. Indeed, Fi — R{V() and for any 
c e {a, b} such that cV- G Factt one has R(c)R(V[) = R(c)Fi. Note that such a 
letter c exists always as t is recurrent. Since for any letter c, i?(c) terminates in the 
letter a it follows that aFi G Facts. Since a-Fi = |E/j| and \Ui\b > \aFi\b — \Fi\b, 
one has that Ui is rich in b. Hence, r s (Ui) = r t (Vi) = b. □ 

Theorem 2. Let s be a Sturmian word having a factorization 

s = Ui ■ ■ ■ U n ■ ■ ■ , 

where each Ui, i > 1, is a non-empty prefix of s. Then there exist integers i,j > 1 
such that r s (Ui) ^ r s (Uj). 

Proof. Let s be a Sturmian word and suppose that s admits a factorization 

S = Ui - ■ - U n - ■ ■ , 

where for i > 1, Ui are non-empty prefixes such that for all i, j > 1, r s {Ui) = 
r s (Uj). Among all Sturmian words having this property we can always consider 
a Sturmian word s such that \U\\ is minimal. Without loss of generality we can 
suppose that s is of type a. By Proposition 2 there exists a Sturmian word t 
such that 

t = Vi ■ ■ ■ V n ■ ■ ■ , 

where for alH > 1, Vi are non-empty prefixes, r t (Vi) = r t (Vj) for all i,j > 1, 
and V\ | < | U\ \ , that contradicts the minimality of the length of U\ . □ 

Theorem 3. Let s be a Sturmian word. There exists a coloring c of the non- 
empty factors of s, c : Fact + s — > {0, 1, 2} such that for any factorization 

s = V ■ ■ ■ V n ■ ■ ■ 



in non-empty factors Vi, i>l, there exist integers i,j such that c(V{) ^ c(Vj). 



Proof. Let us define the coloring c as: for any V G Fact s 

!0 if V is not a prefix of s 
1 if V is a prefix of s and r s (V) = a 
2 if V is a prefix of s and r s (V) = b 

Let us suppose to contrary that for all c(V^) = c(Vj) = x G {0, 1, 2}. If a; = 
we reach a contradiction as V\ is a prefix of s so that c(Vi) G {1,2}. If x = 1 
or x = 2, then all Vi have to be prefixes of s having the same richness, but this 
contradicts Theorem 2. □ 



5 The case of standard episturmian words 

An infinite word s over the alphabet A is called standard episturmian if it is 
closed under reversal and every left special factor of s is a prefix of s. A word 
s G A u is called episturmian if there exists a standard episturmian t G A" such 
that Fact s = Fact t. We recall the following facts about episturmian words [4, 
6]: 

Fact 1. Every prefix of an aperiodic standard episturmian word s is a left 
special factor of s. In particular an aperiodic standard episturmian word on a 
two-letter alphabet is a standard Sturmian word. 

Fact 2. If s is a standard episturmian word with first letter a, then a is 
separating, i.e., for any x, y G A if xy G Fact s, then a G {x, y}. 

For each x € A, let denote the standard episturmian morphism [6] defined 
for any y G A by L x (y) = x if y — x and L x (y) = xy for x ^ y. 

Fact 3. The infinite word s G A w is standard episturmian if and only if there 
exist a standard episturmian word t and a G A such that s = L a (t). Moreover, 
t is unique and the first letter of s is a. 

The following was proved in [9] : 

Fact 4. A recurrent word w over the alphabet A is episturmian if and only if 
for each factor u of w, a letter b exists (depending on u) such that AuACiF&ct w C 
6mA U Am6. 

Definition 3. IFe say t/iat a standard episturmian word s is of type a, a G A, 
if the first letter of s is a. 

Theorem 4. Let s be an aperiodic standard episturmian word over the alphabet 
A and let s = XJ\U<i ■ ■ ■ be any factoring of s with each Ui, i > 1, a non-empty 
prefix of s. Then there exist indices i ^ j for which Ui and Uj terminate in a 
different letter. 

Proof. Suppose to the contrary that there exists an aperiodic standard epistur- 
mian word s over the alphabet A admitting a factorization s — U\Ui ■ ■ ■ in 
which all Ui are non-empty prefixes of s ending in the same letter. Amongst all 



aperiodic standard cpisturmian words over the alphabet A having the preceding 
factorization, we may choose one such s for which \Ui\ is minimal. Let a € Abe 
the first letter of s, so that s is of type a. 

Let us now prove that for every i > 0, one has that V^a is a prefix of s. Let us 
first suppose that for alH > 1, £/, ends in a letter x ^ a. Since a is separating (s 
is of type a), ir can be followed only by a, so that the prefix Ui can be followed 
only by a. This implies that Via is a prefix of s. 

Let us then suppose that for all i > 1, [/, ends in a. Since ?7i is a prefix of 
s, and all Ui, i > 1, begin in a one has that £/ia is a prefix of s. Now let i > 1. 
Since ?7j_i ends in a it follows that aL^a is a factor of s. 

Let L^a; be a prefix of s; we want to show that x — a. Since UiX is left special 
(as it is a prefix of s), there exists a letter y ^ a such that ylJiX is a factor of 
s. Now from this and by Fact 4, there exists a letter b (depending only on Ui) 
such that either x — b or y = b. 

So now, by Fact 4, since aUiCi and j/C/iX are both factors of s, we deduce 
b = a and either x = a or y = a. Since y 7^ a, it follows that x = a. Therefore, 
we have proved that for every i > 1, Uid is a prefix of s. 

Let us now observe that U\ must contain the occurrence of a letter x =/= a. 
Indeed, otherwise, suppose that Ui — a k and consider the least i > 1 such that 
x occurs in Ui. This implies, by using an argument similar to that of the proof 
of Lemma 4, that Ui cannot be a prefix of s. 

By Fact 3, one has that there exists a unique standard episturmian word s' such 
that s = L a (s') and alphs' C alphs C A. Moreover, since s is aperiodic, trivially 
one has that also s' is aperiodic. 

Let us observe that the set X = {a} U {ax \ x e A} is a code having deci- 
phering delay equal to 1 and that any pair (x, a) with x e A is synchronizing for 
X°°. This implies that s can be uniquely factored by the words of X. Moreover, 
since Uid is a prefix of s, from the synchronization property of X°°, it follows 
that for each i > 1, 

Ui - L a (Ul), 

where U[ is a prefix of s' . From the definition of L a and the preceding formula, 
one has that the last letter of Ui is equal to the last letter of U[. 
Moreover, 

L a (U[ ■■■U' n ---) = U 1 ---U n --- = s = L a (s'). 

Thus s' = U[ - ■ ■ U' n ■ ■ ■ , where each i > 1, is a non-empty prefix of s' and for 
ah i) J > L f^i an d terminate in the same letter. Since in U\ = L a (U[) there 
is the occurrence of a letter different from a one obtains that \U[\ < \Ui\ which 
is a contradiction. □ 

Let us observe that in the case of a standard Sturmian word, Theorem 4 is 
an immediate consequence of Theorem 2 and Lemma 1 . 

Theorem 5. Let s be an aperiodic standard episturmian word and let k = 
card(alphs). There exists a coloring c of the non-empty factors of s, c : Fact + s — > 



{0,1,..., k} such that for any factorization 

s = Vi • • • V n ■ ■ ■ 

in non-empty factors Vi, i > 1, there exist integers i,j such that c(Vi) 7^ c(Vj). 
Proof. Let alphs = {ai, . . . , a^}. We define the coloring c as: for any V G Fact + s 
' if V is not a prefix of s 



c(V) 



1 if V is a prefix of s terminating in ai 



k k if V is a prefix of s terminating in 



Let us suppose by contradiction that for all c(Vi) = c(Vj) = x G {0, 1, . . . , k}. 
If x — we reach a contradiction as V\ is a prefix of s so that c(V\) G {1, . . . , k). 
If a; G {1, . . . , fc}, then all Vi have to be prefixes of s terminating in the same 
letter, but this contradicts Theorem 4. □ 
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